Overview
Brought to you by YData
Dataset statistics
| Train | Test | |
|---|---|---|
| Number of variables | 10 | 9 |
| Number of observations | 90615 | 60411 |
| Missing cells | 0 | 0 |
| Missing cells (%) | 0.0% | 0.0% |
| Duplicate rows | 0 | 0 |
| Duplicate rows (%) | 0.0% | 0.0% |
| Total size in memory | 6.9 MiB | 4.1 MiB |
| Average record size in memory | 80.0 B | 72.0 B |
Variable types
| Train | Test | |
|---|---|---|
| Numeric | 9 | 8 |
| Categorical | 1 | 1 |
| Train | Test | |
|---|---|---|
Diameter is highly overall correlated with Height and 6 other fields | Diameter is highly overall correlated with Height and 5 other fields | High correlation |
Height is highly overall correlated with Diameter and 6 other fields | Height is highly overall correlated with Diameter and 5 other fields | High correlation |
Length is highly overall correlated with Diameter and 6 other fields | Length is highly overall correlated with Diameter and 5 other fields | High correlation |
Rings is highly overall correlated with Diameter and 6 other fields | Alert not present in this dataset | High correlation |
Shell weight is highly overall correlated with Diameter and 6 other fields | Shell weight is highly overall correlated with Diameter and 5 other fields | High correlation |
Whole weight is highly overall correlated with Diameter and 6 other fields | Whole weight is highly overall correlated with Diameter and 5 other fields | High correlation |
Whole weight.1 is highly overall correlated with Diameter and 6 other fields | Whole weight.1 is highly overall correlated with Diameter and 5 other fields | High correlation |
Whole weight.2 is highly overall correlated with Diameter and 6 other fields | Whole weight.2 is highly overall correlated with Diameter and 5 other fields | High correlation |
id is uniformly distributed | id is uniformly distributed | Uniform |
id has unique values | id has unique values | Unique |
Reproduction
| Train | Test | |
|---|---|---|
| Analysis started | 2025-05-18 18:13:33.496160 | 2025-05-18 18:13:42.916565 |
| Analysis finished | 2025-05-18 18:13:39.088853 | 2025-05-18 18:13:47.472279 |
| Duration | 5.59 seconds | 4.56 seconds |
| Software version | ydata-profiling vv4.16.1 | ydata-profiling vv4.16.1 |
| Download configuration | config.json | config.json |
Variables
id
Real number (ℝ)
| Train | Test | |
|---|---|---|
| Distinct | 90615 | 60411 |
| Distinct (%) | 100.0% | 100.0% |
| Missing | 0 | 0 |
| Missing (%) | 0.0% | 0.0% |
| Infinite | 0 | 0 |
| Infinite (%) | 0.0% | 0.0% |
| Mean | 45307 | 120820 |
| Train | Test | |
|---|---|---|
| Minimum | 0 | 90615 |
| Maximum | 90614 | 151025 |
| Zeros | 1 | 0 |
| Zeros (%) | < 0.1% | 0.0% |
| Negative | 0 | 0 |
| Negative (%) | 0.0% | 0.0% |
| Memory size | 708.1 KiB | 472.1 KiB |
Quantile statistics
| Train | Test | |
|---|---|---|
| Minimum | 0 | 90615 |
| 5-th percentile | 4530.7 | 93635.5 |
| Q1 | 22653.5 | 105717.5 |
| median | 45307 | 120820 |
| Q3 | 67960.5 | 135922.5 |
| 95-th percentile | 86083.3 | 148004.5 |
| Maximum | 90614 | 151025 |
| Range | 90614 | 60410 |
| Interquartile range (IQR) | 45307 | 30205 |
Descriptive statistics
| Train | Test | |
|---|---|---|
| Standard deviation | 26158.442 | 17439.298 |
| Coefficient of variation (CV) | 0.57735983 | 0.14434115 |
| Kurtosis | -1.2 | -1.2 |
| Mean | 45307 | 120820 |
| Median Absolute Deviation (MAD) | 22654 | 15103 |
| Skewness | 0 | 0 |
| Sum | 4.1054938 × 109 | 7.298857 × 109 |
| Variance | 6.8426407 × 108 | 3.0412911 × 108 |
| Monotonicity | Strictly increasing | Strictly increasing |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) |
| 90614 | 1 | < 0.1% |
| 0 | 1 | < 0.1% |
| 1 | 1 | < 0.1% |
| 2 | 1 | < 0.1% |
| 3 | 1 | < 0.1% |
| 4 | 1 | < 0.1% |
| 5 | 1 | < 0.1% |
| 90598 | 1 | < 0.1% |
| 90597 | 1 | < 0.1% |
| 90596 | 1 | < 0.1% |
| Other values (90605) | 90605 |
| Value | Count | Frequency (%) |
| 151025 | 1 | < 0.1% |
| 90615 | 1 | < 0.1% |
| 90616 | 1 | < 0.1% |
| 90617 | 1 | < 0.1% |
| 90618 | 1 | < 0.1% |
| 90619 | 1 | < 0.1% |
| 90620 | 1 | < 0.1% |
| 90621 | 1 | < 0.1% |
| 90622 | 1 | < 0.1% |
| 90623 | 1 | < 0.1% |
| Other values (60401) | 60401 |
| Value | Count | Frequency (%) |
| 0 | 1 | |
| 1 | 1 | |
| 2 | 1 | |
| 3 | 1 | |
| 4 | 1 | |
| 5 | 1 | |
| 6 | 1 | |
| 7 | 1 | |
| 8 | 1 | |
| 9 | 1 |
| Value | Count | Frequency (%) |
| 90615 | 1 | |
| 90616 | 1 | |
| 90617 | 1 | |
| 90618 | 1 | |
| 90619 | 1 | |
| 90620 | 1 | |
| 90621 | 1 | |
| 90622 | 1 | |
| 90623 | 1 | |
| 90624 | 1 |
| Value | Count | Frequency (%) |
| 90615 | 1 | |
| 90616 | 1 | |
| 90617 | 1 | |
| 90618 | 1 | |
| 90619 | 1 | |
| 90620 | 1 | |
| 90621 | 1 | |
| 90622 | 1 | |
| 90623 | 1 | |
| 90624 | 1 |
| Value | Count | Frequency (%) |
| 0 | 1 | |
| 1 | 1 | |
| 2 | 1 | |
| 3 | 1 | |
| 4 | 1 | |
| 5 | 1 | |
| 6 | 1 | |
| 7 | 1 | |
| 8 | 1 | |
| 9 | 1 |
Sex
Categorical
| Train | Test | |
|---|---|---|
| Distinct | 3 | 3 |
| Distinct (%) | < 0.1% | < 0.1% |
| Missing | 0 | 0 |
| Missing (%) | 0.0% | 0.0% |
| Memory size | 708.1 KiB | 472.1 KiB |
| I | |
|---|---|
| M | |
| F |
| I | |
|---|---|
| M | |
| F |
Length
| Train | Test | |
|---|---|---|
| Max length | 1 | 1 |
| Median length | 1 | 1 |
| Mean length | 1 | 1 |
| Min length | 1 | 1 |
Unique
| Train | Test | |
|---|---|---|
| Unique | 0 | 0 ? |
| Unique (%) | 0.0% | 0.0% |
Sample
| Train | Test | |
|---|---|---|
| 1st row | F | M |
| 2nd row | F | M |
| 3rd row | I | M |
| 4th row | M | M |
| 5th row | I | I |
Common Values
| Value | Count | Frequency (%) |
| I | 33093 | |
| M | 31027 | |
| F | 26495 |
| Value | Count | Frequency (%) |
| I | 22241 | |
| M | 20783 | |
| F | 17387 |
Length
Histogram of lengths of the category
Common Values (Plot)
Train
Test
| Value | Count | Frequency (%) |
| i | 33093 | |
| m | 31027 | |
| f | 26495 |
| Value | Count | Frequency (%) |
| i | 22241 | |
| m | 20783 | |
| f | 17387 |
Most occurring characters
| Value | Count | Frequency (%) |
| I | 33093 | |
| M | 31027 | |
| F | 26495 |
| Value | Count | Frequency (%) |
| I | 22241 | |
| M | 20783 | |
| F | 17387 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 90615 |
| Value | Count | Frequency (%) |
| Uppercase Letter | 60411 |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| I | 33093 | |
| M | 31027 | |
| F | 26495 |
| Value | Count | Frequency (%) |
| I | 22241 | |
| M | 20783 | |
| F | 17387 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 90615 |
| Value | Count | Frequency (%) |
| Latin | 60411 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| I | 33093 | |
| M | 31027 | |
| F | 26495 |
| Value | Count | Frequency (%) |
| I | 22241 | |
| M | 20783 | |
| F | 17387 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 90615 |
| Value | Count | Frequency (%) |
| ASCII | 60411 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| I | 33093 | |
| M | 31027 | |
| F | 26495 |
| Value | Count | Frequency (%) |
| I | 22241 | |
| M | 20783 | |
| F | 17387 |
Length
Real number (ℝ)
| Train | Test | |
|---|---|---|
| Distinct | 157 | 148 |
| Distinct (%) | 0.2% | 0.2% |
| Missing | 0 | 0 |
| Missing (%) | 0.0% | 0.0% |
| Infinite | 0 | 0 |
| Infinite (%) | 0.0% | 0.0% |
| Mean | 0.51709842 | 0.51742818 |
| Train | Test | |
|---|---|---|
| Minimum | 0.075 | 0.075 |
| Maximum | 0.815 | 0.8 |
| Zeros | 0 | 0 |
| Zeros (%) | 0.0% | 0.0% |
| Negative | 0 | 0 |
| Negative (%) | 0.0% | 0.0% |
| Memory size | 708.1 KiB | 472.1 KiB |
Quantile statistics
| Train | Test | |
|---|---|---|
| Minimum | 0.075 | 0.075 |
| 5-th percentile | 0.28 | 0.28 |
| Q1 | 0.445 | 0.45 |
| median | 0.545 | 0.545 |
| Q3 | 0.6 | 0.6 |
| 95-th percentile | 0.68 | 0.675 |
| Maximum | 0.815 | 0.8 |
| Range | 0.74 | 0.725 |
| Interquartile range (IQR) | 0.155 | 0.15 |
Descriptive statistics
| Train | Test | |
|---|---|---|
| Standard deviation | 0.11821671 | 0.1176087 |
| Coefficient of variation (CV) | 0.22861549 | 0.22729474 |
| Kurtosis | 0.1333638 | 0.14178863 |
| Mean | 0.51709842 | 0.51742818 |
| Median Absolute Deviation (MAD) | 0.07 | 0.07 |
| Skewness | -0.73201519 | -0.73456488 |
| Sum | 46856.874 | 31258.354 |
| Variance | 0.01397519 | 0.013831807 |
| Monotonicity | Not monotonic | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) |
| 0.575 | 3267 | 3.6% |
| 0.58 | 2670 | 2.9% |
| 0.57 | 2167 | 2.4% |
| 0.55 | 2122 | 2.3% |
| 0.595 | 1992 | 2.2% |
| 0.525 | 1985 | 2.2% |
| 0.6 | 1961 | 2.2% |
| 0.585 | 1911 | 2.1% |
| 0.53 | 1908 | 2.1% |
| 0.565 | 1906 | 2.1% |
| Other values (147) | 68726 |
| Value | Count | Frequency (%) |
| 0.575 | 2094 | 3.5% |
| 0.58 | 1706 | 2.8% |
| 0.57 | 1400 | 2.3% |
| 0.6 | 1380 | 2.3% |
| 0.55 | 1346 | 2.2% |
| 0.525 | 1320 | 2.2% |
| 0.595 | 1315 | 2.2% |
| 0.585 | 1301 | 2.2% |
| 0.53 | 1293 | 2.1% |
| 0.565 | 1292 | 2.1% |
| Other values (138) | 45964 |
| Value | Count | Frequency (%) |
| 0.075 | 4 | < 0.1% |
| 0.09 | 3 | < 0.1% |
| 0.1 | 2 | < 0.1% |
| 0.105 | 1 | < 0.1% |
| 0.11 | 11 | |
| 0.115 | 1 | < 0.1% |
| 0.12 | 2 | < 0.1% |
| 0.125 | 2 | < 0.1% |
| 0.13 | 24 | |
| 0.135 | 16 |
| Value | Count | Frequency (%) |
| 0.075 | 3 | < 0.1% |
| 0.09 | 1 | < 0.1% |
| 0.095 | 1 | < 0.1% |
| 0.1 | 3 | < 0.1% |
| 0.11 | 4 | < 0.1% |
| 0.125 | 4 | < 0.1% |
| 0.13 | 19 | |
| 0.135 | 8 | < 0.1% |
| 0.14 | 16 | |
| 0.15 | 21 |
| Value | Count | Frequency (%) |
| 0.075 | 3 | < 0.1% |
| 0.09 | 1 | < 0.1% |
| 0.095 | 1 | < 0.1% |
| 0.1 | 3 | < 0.1% |
| 0.11 | 4 | < 0.1% |
| 0.125 | 4 | < 0.1% |
| 0.13 | 19 | |
| 0.135 | 8 | < 0.1% |
| 0.14 | 16 | |
| 0.15 | 21 |
| Value | Count | Frequency (%) |
| 0.075 | 4 | < 0.1% |
| 0.09 | 3 | < 0.1% |
| 0.1 | 2 | < 0.1% |
| 0.105 | 1 | < 0.1% |
| 0.11 | 11 | |
| 0.115 | 1 | < 0.1% |
| 0.12 | 2 | < 0.1% |
| 0.125 | 2 | < 0.1% |
| 0.13 | 24 | |
| 0.135 | 16 |
Diameter
Real number (ℝ)
| Train | Test | |
|---|---|---|
| Distinct | 126 | 130 |
| Distinct (%) | 0.1% | 0.2% |
| Missing | 0 | 0 |
| Missing (%) | 0.0% | 0.0% |
| Infinite | 0 | 0 |
| Infinite (%) | 0.0% | 0.0% |
| Mean | 0.40167916 | 0.40196134 |
| Train | Test | |
|---|---|---|
| Minimum | 0.055 | 0.055 |
| Maximum | 0.65 | 0.65 |
| Zeros | 0 | 0 |
| Zeros (%) | 0.0% | 0.0% |
| Negative | 0 | 0 |
| Negative (%) | 0.0% | 0.0% |
| Memory size | 708.1 KiB | 472.1 KiB |
Quantile statistics
| Train | Test | |
|---|---|---|
| Minimum | 0.055 | 0.055 |
| 5-th percentile | 0.21 | 0.21 |
| Q1 | 0.345 | 0.345 |
| median | 0.425 | 0.425 |
| Q3 | 0.47 | 0.47 |
| 95-th percentile | 0.535 | 0.535 |
| Maximum | 0.65 | 0.65 |
| Range | 0.595 | 0.595 |
| Interquartile range (IQR) | 0.125 | 0.125 |
Descriptive statistics
| Train | Test | |
|---|---|---|
| Standard deviation | 0.098026319 | 0.097469695 |
| Coefficient of variation (CV) | 0.24404134 | 0.24248525 |
| Kurtosis | 0.00064626408 | 0.0040647768 |
| Mean | 0.40167916 | 0.40196134 |
| Median Absolute Deviation (MAD) | 0.06 | 0.055 |
| Skewness | -0.69523597 | -0.69631151 |
| Sum | 36398.157 | 24282.887 |
| Variance | 0.0096091593 | 0.0095003414 |
| Monotonicity | Not monotonic | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) |
| 0.45 | 4182 | 4.6% |
| 0.475 | 3307 | 3.6% |
| 0.455 | 2715 | 3.0% |
| 0.4 | 2667 | 2.9% |
| 0.47 | 2441 | 2.7% |
| 0.465 | 2272 | 2.5% |
| 0.46 | 2259 | 2.5% |
| 0.425 | 2227 | 2.5% |
| 0.44 | 2182 | 2.4% |
| 0.435 | 2131 | 2.4% |
| Other values (116) | 64232 |
| Value | Count | Frequency (%) |
| 0.45 | 2682 | 4.4% |
| 0.475 | 2206 | 3.7% |
| 0.455 | 1779 | 2.9% |
| 0.4 | 1715 | 2.8% |
| 0.47 | 1668 | 2.8% |
| 0.465 | 1537 | 2.5% |
| 0.46 | 1535 | 2.5% |
| 0.44 | 1507 | 2.5% |
| 0.425 | 1492 | 2.5% |
| 0.435 | 1396 | 2.3% |
| Other values (120) | 42894 |
| Value | Count | Frequency (%) |
| 0.055 | 1 | < 0.1% |
| 0.06 | 1 | < 0.1% |
| 0.065 | 1 | < 0.1% |
| 0.075 | 1 | < 0.1% |
| 0.085 | 2 | < 0.1% |
| 0.09 | 12 | < 0.1% |
| 0.095 | 4 | < 0.1% |
| 0.1 | 20 | < 0.1% |
| 0.103 | 1 | < 0.1% |
| 0.105 | 73 |
| Value | Count | Frequency (%) |
| 0.055 | 3 | < 0.1% |
| 0.07 | 1 | < 0.1% |
| 0.075 | 1 | < 0.1% |
| 0.08 | 2 | < 0.1% |
| 0.085 | 1 | < 0.1% |
| 0.09 | 7 | < 0.1% |
| 0.095 | 3 | < 0.1% |
| 0.1 | 18 | < 0.1% |
| 0.105 | 51 | |
| 0.11 | 51 |
| Value | Count | Frequency (%) |
| 0.055 | 3 | < 0.1% |
| 0.07 | 1 | < 0.1% |
| 0.075 | 1 | < 0.1% |
| 0.08 | 2 | < 0.1% |
| 0.085 | 1 | < 0.1% |
| 0.09 | 7 | < 0.1% |
| 0.095 | 3 | < 0.1% |
| 0.1 | 18 | < 0.1% |
| 0.105 | 51 | |
| 0.11 | 51 |
| Value | Count | Frequency (%) |
| 0.055 | 1 | < 0.1% |
| 0.06 | 1 | < 0.1% |
| 0.065 | 1 | < 0.1% |
| 0.075 | 1 | < 0.1% |
| 0.085 | 2 | < 0.1% |
| 0.09 | 12 | < 0.1% |
| 0.095 | 4 | < 0.1% |
| 0.1 | 20 | < 0.1% |
| 0.103 | 1 | < 0.1% |
| 0.105 | 73 |
Height
Real number (ℝ)
| Train | Test | |
|---|---|---|
| Distinct | 90 | 85 |
| Distinct (%) | 0.1% | 0.1% |
| Missing | 0 | 0 |
| Missing (%) | 0.0% | 0.0% |
| Infinite | 0 | 0 |
| Infinite (%) | 0.0% | 0.0% |
| Mean | 0.13546406 | 0.13575108 |
| Train | Test | |
|---|---|---|
| Minimum | 0 | 0 |
| Maximum | 1.13 | 1.095 |
| Zeros | 6 | 2 |
| Zeros (%) | < 0.1% | < 0.1% |
| Negative | 0 | 0 |
| Negative (%) | 0.0% | 0.0% |
| Memory size | 708.1 KiB | 472.1 KiB |
Quantile statistics
| Train | Test | |
|---|---|---|
| Minimum | 0 | 0 |
| 5-th percentile | 0.07 | 0.07 |
| Q1 | 0.11 | 0.11 |
| median | 0.14 | 0.14 |
| Q3 | 0.16 | 0.16 |
| 95-th percentile | 0.195 | 0.195 |
| Maximum | 1.13 | 1.095 |
| Range | 1.13 | 1.095 |
| Interquartile range (IQR) | 0.05 | 0.05 |
Descriptive statistics
| Train | Test | |
|---|---|---|
| Standard deviation | 0.038007562 | 0.038174758 |
| Coefficient of variation (CV) | 0.28057304 | 0.28121145 |
| Kurtosis | 13.454051 | 17.693333 |
| Mean | 0.13546406 | 0.13575108 |
| Median Absolute Deviation (MAD) | 0.025 | 0.025 |
| Skewness | 0.30997506 | 0.55450613 |
| Sum | 12275.075 | 8200.8585 |
| Variance | 0.0014445748 | 0.0014573121 |
| Monotonicity | Not monotonic | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) |
| 0.15 | 5742 | 6.3% |
| 0.14 | 5415 | 6.0% |
| 0.155 | 5230 | 5.8% |
| 0.145 | 5048 | 5.6% |
| 0.135 | 4980 | 5.5% |
| 0.125 | 4478 | 4.9% |
| 0.175 | 4174 | 4.6% |
| 0.16 | 3946 | 4.4% |
| 0.165 | 3772 | 4.2% |
| 0.13 | 3603 | 4.0% |
| Other values (80) | 44227 |
| Value | Count | Frequency (%) |
| 0.15 | 3893 | 6.4% |
| 0.14 | 3625 | 6.0% |
| 0.155 | 3433 | 5.7% |
| 0.135 | 3322 | 5.5% |
| 0.145 | 3291 | 5.4% |
| 0.125 | 3005 | 5.0% |
| 0.16 | 2676 | 4.4% |
| 0.175 | 2636 | 4.4% |
| 0.165 | 2525 | 4.2% |
| 0.13 | 2462 | 4.1% |
| Other values (75) | 29543 |
| Value | Count | Frequency (%) |
| 0 | 6 | < 0.1% |
| 0.004 | 1 | < 0.1% |
| 0.005 | 3 | < 0.1% |
| 0.009 | 1 | < 0.1% |
| 0.01 | 4 | < 0.1% |
| 0.015 | 16 | < 0.1% |
| 0.019 | 1 | < 0.1% |
| 0.02 | 29 | < 0.1% |
| 0.025 | 100 | |
| 0.03 | 129 |
| Value | Count | Frequency (%) |
| 0 | 2 | < 0.1% |
| 0.005 | 2 | < 0.1% |
| 0.01 | 3 | < 0.1% |
| 0.0105 | 1 | < 0.1% |
| 0.015 | 6 | < 0.1% |
| 0.02 | 18 | < 0.1% |
| 0.025 | 62 | 0.1% |
| 0.03 | 83 | 0.1% |
| 0.035 | 113 | |
| 0.04 | 234 |
| Value | Count | Frequency (%) |
| 0 | 2 | < 0.1% |
| 0.005 | 2 | < 0.1% |
| 0.01 | 3 | < 0.1% |
| 0.0105 | 1 | < 0.1% |
| 0.015 | 6 | < 0.1% |
| 0.02 | 18 | < 0.1% |
| 0.025 | 62 | 0.1% |
| 0.03 | 83 | 0.1% |
| 0.035 | 113 | |
| 0.04 | 234 |
| Value | Count | Frequency (%) |
| 0 | 6 | < 0.1% |
| 0.004 | 1 | < 0.1% |
| 0.005 | 3 | < 0.1% |
| 0.009 | 1 | < 0.1% |
| 0.01 | 4 | < 0.1% |
| 0.015 | 16 | < 0.1% |
| 0.019 | 1 | < 0.1% |
| 0.02 | 29 | < 0.1% |
| 0.025 | 100 | |
| 0.03 | 129 |
Whole weight
Real number (ℝ)
| Train | Test | |
|---|---|---|
| Distinct | 3175 | 3037 |
| Distinct (%) | 3.5% | 5.0% |
| Missing | 0 | 0 |
| Missing (%) | 0.0% | 0.0% |
| Infinite | 0 | 0 |
| Infinite (%) | 0.0% | 0.0% |
| Mean | 0.78903495 | 0.7900623 |
| Train | Test | |
|---|---|---|
| Minimum | 0.002 | 0.002 |
| Maximum | 2.8255 | 2.8255 |
| Zeros | 0 | 0 |
| Zeros (%) | 0.0% | 0.0% |
| Negative | 0 | 0 |
| Negative (%) | 0.0% | 0.0% |
| Memory size | 708.1 KiB | 472.1 KiB |
Quantile statistics
| Train | Test | |
|---|---|---|
| Minimum | 0.002 | 0.002 |
| 5-th percentile | 0.1075 | 0.1075 |
| Q1 | 0.419 | 0.4195 |
| median | 0.7995 | 0.8015 |
| Q3 | 1.0675 | 1.07 |
| 95-th percentile | 1.6185 | 1.6185 |
| Maximum | 2.8255 | 2.8255 |
| Range | 2.8235 | 2.8235 |
| Interquartile range (IQR) | 0.6485 | 0.6505 |
Descriptive statistics
| Train | Test | |
|---|---|---|
| Standard deviation | 0.4576707 | 0.45759058 |
| Coefficient of variation (CV) | 0.58003856 | 0.5791829 |
| Kurtosis | -0.18513558 | -0.16542642 |
| Mean | 0.78903495 | 0.7900623 |
| Median Absolute Deviation (MAD) | 0.322 | 0.322 |
| Skewness | 0.42931626 | 0.43566352 |
| Sum | 71498.402 | 47728.454 |
| Variance | 0.20946247 | 0.20938914 |
| Monotonicity | Not monotonic | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) |
| 0.5805 | 485 | 0.5% |
| 0.7665 | 477 | 0.5% |
| 0.974 | 361 | 0.4% |
| 0.2225 | 347 | 0.4% |
| 0.8295 | 325 | 0.4% |
| 0.874 | 318 | 0.4% |
| 0.6355 | 318 | 0.4% |
| 1.4385 | 310 | 0.3% |
| 0.879 | 306 | 0.3% |
| 1.1345 | 301 | 0.3% |
| Other values (3165) | 87067 |
| Value | Count | Frequency (%) |
| 0.5805 | 329 | 0.5% |
| 0.7665 | 313 | 0.5% |
| 0.8295 | 256 | 0.4% |
| 0.974 | 254 | 0.4% |
| 1.0265 | 225 | 0.4% |
| 0.6355 | 221 | 0.4% |
| 0.874 | 216 | 0.4% |
| 0.2225 | 205 | 0.3% |
| 0.9685 | 205 | 0.3% |
| 0.873 | 202 | 0.3% |
| Other values (3027) | 57985 |
| Value | Count | Frequency (%) |
| 0.002 | 2 | < 0.1% |
| 0.005 | 2 | < 0.1% |
| 0.0055 | 2 | < 0.1% |
| 0.0065 | 1 | < 0.1% |
| 0.008 | 6 | < 0.1% |
| 0.0095 | 1 | < 0.1% |
| 0.0105 | 34 | |
| 0.011 | 4 | < 0.1% |
| 0.0115 | 2 | < 0.1% |
| 0.012 | 2 | < 0.1% |
| Value | Count | Frequency (%) |
| 0.002 | 2 | < 0.1% |
| 0.004 | 1 | < 0.1% |
| 0.0045 | 1 | < 0.1% |
| 0.0075 | 1 | < 0.1% |
| 0.008 | 9 | |
| 0.0095 | 2 | < 0.1% |
| 0.0105 | 17 | |
| 0.0115 | 2 | < 0.1% |
| 0.012 | 1 | < 0.1% |
| 0.0125 | 4 | < 0.1% |
| Value | Count | Frequency (%) |
| 0.002 | 2 | < 0.1% |
| 0.004 | 1 | < 0.1% |
| 0.0045 | 1 | < 0.1% |
| 0.0075 | 1 | < 0.1% |
| 0.008 | 9 | |
| 0.0095 | 2 | < 0.1% |
| 0.0105 | 17 | |
| 0.0115 | 2 | < 0.1% |
| 0.012 | 1 | < 0.1% |
| 0.0125 | 4 | < 0.1% |
| Value | Count | Frequency (%) |
| 0.002 | 2 | < 0.1% |
| 0.005 | 2 | < 0.1% |
| 0.0055 | 2 | < 0.1% |
| 0.0065 | 1 | < 0.1% |
| 0.008 | 6 | < 0.1% |
| 0.0095 | 1 | < 0.1% |
| 0.0105 | 34 | |
| 0.011 | 4 | < 0.1% |
| 0.0115 | 2 | < 0.1% |
| 0.012 | 2 | < 0.1% |
Whole weight.1
Real number (ℝ)
| Train | Test | |
|---|---|---|
| Distinct | 1799 | 1747 |
| Distinct (%) | 2.0% | 2.9% |
| Missing | 0 | 0 |
| Missing (%) | 0.0% | 0.0% |
| Infinite | 0 | 0 |
| Infinite (%) | 0.0% | 0.0% |
| Mean | 0.34077811 | 0.34122687 |
| Train | Test | |
|---|---|---|
| Minimum | 0.001 | 0.001 |
| Maximum | 1.488 | 1.488 |
| Zeros | 0 | 0 |
| Zeros (%) | 0.0% | 0.0% |
| Negative | 0 | 0 |
| Negative (%) | 0.0% | 0.0% |
| Memory size | 708.1 KiB | 472.1 KiB |
Quantile statistics
| Train | Test | |
|---|---|---|
| Minimum | 0.001 | 0.001 |
| 5-th percentile | 0.043 | 0.044 |
| Q1 | 0.1775 | 0.1785 |
| median | 0.33 | 0.329 |
| Q3 | 0.463 | 0.4645 |
| 95-th percentile | 0.7105 | 0.711 |
| Maximum | 1.488 | 1.488 |
| Range | 1.487 | 1.487 |
| Interquartile range (IQR) | 0.2855 | 0.286 |
Descriptive statistics
| Train | Test | |
|---|---|---|
| Standard deviation | 0.20442848 | 0.20422071 |
| Coefficient of variation (CV) | 0.59988736 | 0.59848952 |
| Kurtosis | 0.28401194 | 0.29017201 |
| Mean | 0.34077811 | 0.34122687 |
| Median Absolute Deviation (MAD) | 0.1435 | 0.1435 |
| Skewness | 0.59197329 | 0.59320576 |
| Sum | 30879.608 | 20613.856 |
| Variance | 0.041791003 | 0.041706097 |
| Monotonicity | Not monotonic | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) |
| 0.096 | 403 | 0.4% |
| 0.3485 | 390 | 0.4% |
| 0.2945 | 366 | 0.4% |
| 0.3155 | 365 | 0.4% |
| 0.4935 | 364 | 0.4% |
| 0.3285 | 358 | 0.4% |
| 0.3345 | 345 | 0.4% |
| 0.4035 | 345 | 0.4% |
| 0.5265 | 344 | 0.4% |
| 0.3695 | 343 | 0.4% |
| Other values (1789) | 86992 |
| Value | Count | Frequency (%) |
| 0.3485 | 280 | 0.5% |
| 0.5305 | 246 | 0.4% |
| 0.4235 | 242 | 0.4% |
| 0.3155 | 239 | 0.4% |
| 0.3695 | 239 | 0.4% |
| 0.3285 | 237 | 0.4% |
| 0.096 | 237 | 0.4% |
| 0.4935 | 234 | 0.4% |
| 0.2945 | 230 | 0.4% |
| 0.3935 | 218 | 0.4% |
| Other values (1737) | 58009 |
| Value | Count | Frequency (%) |
| 0.001 | 2 | < 0.1% |
| 0.0015 | 1 | < 0.1% |
| 0.002 | 2 | < 0.1% |
| 0.0025 | 9 | < 0.1% |
| 0.003 | 2 | < 0.1% |
| 0.0035 | 9 | < 0.1% |
| 0.004 | 4 | < 0.1% |
| 0.0045 | 29 | |
| 0.005 | 44 | |
| 0.0055 | 31 |
| Value | Count | Frequency (%) |
| 0.001 | 2 | < 0.1% |
| 0.0015 | 1 | < 0.1% |
| 0.002 | 1 | < 0.1% |
| 0.0025 | 3 | < 0.1% |
| 0.003 | 2 | < 0.1% |
| 0.004 | 2 | < 0.1% |
| 0.0045 | 15 | |
| 0.005 | 31 | |
| 0.0051 | 1 | < 0.1% |
| 0.0055 | 32 |
| Value | Count | Frequency (%) |
| 0.001 | 2 | < 0.1% |
| 0.0015 | 1 | < 0.1% |
| 0.002 | 1 | < 0.1% |
| 0.0025 | 3 | < 0.1% |
| 0.003 | 2 | < 0.1% |
| 0.004 | 2 | < 0.1% |
| 0.0045 | 15 | |
| 0.005 | 31 | |
| 0.0051 | 1 | < 0.1% |
| 0.0055 | 32 |
| Value | Count | Frequency (%) |
| 0.001 | 2 | < 0.1% |
| 0.0015 | 1 | < 0.1% |
| 0.002 | 2 | < 0.1% |
| 0.0025 | 9 | < 0.1% |
| 0.003 | 2 | < 0.1% |
| 0.0035 | 9 | < 0.1% |
| 0.004 | 4 | < 0.1% |
| 0.0045 | 29 | |
| 0.005 | 44 | |
| 0.0055 | 31 |
Whole weight.2
Real number (ℝ)
| Train | Test | |
|---|---|---|
| Distinct | 979 | 960 |
| Distinct (%) | 1.1% | 1.6% |
| Missing | 0 | 0 |
| Missing (%) | 0.0% | 0.0% |
| Infinite | 0 | 0 |
| Infinite (%) | 0.0% | 0.0% |
| Mean | 0.16942184 | 0.16941934 |
| Train | Test | |
|---|---|---|
| Minimum | 0.0005 | 0.0005 |
| Maximum | 0.76 | 0.6415 |
| Zeros | 0 | 0 |
| Zeros (%) | 0.0% | 0.0% |
| Negative | 0 | 0 |
| Negative (%) | 0.0% | 0.0% |
| Memory size | 708.1 KiB | 472.1 KiB |
Quantile statistics
| Train | Test | |
|---|---|---|
| Minimum | 0.0005 | 0.0005 |
| 5-th percentile | 0.023 | 0.0235 |
| Q1 | 0.0865 | 0.0865 |
| median | 0.166 | 0.166 |
| Q3 | 0.2325 | 0.2325 |
| 95-th percentile | 0.3555 | 0.3555 |
| Maximum | 0.76 | 0.6415 |
| Range | 0.7595 | 0.641 |
| Interquartile range (IQR) | 0.146 | 0.146 |
Descriptive statistics
| Train | Test | |
|---|---|---|
| Standard deviation | 0.10090889 | 0.10072047 |
| Coefficient of variation (CV) | 0.59560731 | 0.59450393 |
| Kurtosis | -0.20372097 | -0.20488298 |
| Mean | 0.16942184 | 0.16941934 |
| Median Absolute Deviation (MAD) | 0.0735 | 0.073 |
| Skewness | 0.47673334 | 0.47612882 |
| Sum | 15352.16 | 10234.792 |
| Variance | 0.010182604 | 0.010144612 |
| Monotonicity | Not monotonic | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) |
| 0.1715 | 799 | 0.9% |
| 0.1725 | 689 | 0.8% |
| 0.2195 | 611 | 0.7% |
| 0.1625 | 525 | 0.6% |
| 0.2145 | 501 | 0.6% |
| 0.1815 | 500 | 0.6% |
| 0.0265 | 494 | 0.5% |
| 0.1825 | 480 | 0.5% |
| 0.1405 | 474 | 0.5% |
| 0.1905 | 470 | 0.5% |
| Other values (969) | 85072 |
| Value | Count | Frequency (%) |
| 0.1715 | 540 | 0.9% |
| 0.1725 | 445 | 0.7% |
| 0.2195 | 417 | 0.7% |
| 0.0265 | 375 | 0.6% |
| 0.1625 | 365 | 0.6% |
| 0.2145 | 349 | 0.6% |
| 0.1815 | 324 | 0.5% |
| 0.1405 | 321 | 0.5% |
| 0.1825 | 318 | 0.5% |
| 0.1905 | 303 | 0.5% |
| Other values (950) | 56654 |
| Value | Count | Frequency (%) |
| 0.0005 | 17 | < 0.1% |
| 0.001 | 3 | < 0.1% |
| 0.0015 | 3 | < 0.1% |
| 0.002 | 7 | < 0.1% |
| 0.0025 | 53 | 0.1% |
| 0.003 | 37 | < 0.1% |
| 0.0035 | 83 | |
| 0.004 | 5 | < 0.1% |
| 0.0045 | 106 | |
| 0.005 | 169 |
| Value | Count | Frequency (%) |
| 0.0005 | 14 | < 0.1% |
| 0.001 | 2 | < 0.1% |
| 0.0015 | 1 | < 0.1% |
| 0.002 | 4 | < 0.1% |
| 0.0025 | 43 | |
| 0.003 | 18 | < 0.1% |
| 0.0035 | 45 | |
| 0.004 | 6 | < 0.1% |
| 0.0045 | 68 | |
| 0.005 | 89 |
| Value | Count | Frequency (%) |
| 0.0005 | 14 | < 0.1% |
| 0.001 | 2 | < 0.1% |
| 0.0015 | 1 | < 0.1% |
| 0.002 | 4 | < 0.1% |
| 0.0025 | 43 | |
| 0.003 | 18 | < 0.1% |
| 0.0035 | 45 | |
| 0.004 | 6 | < 0.1% |
| 0.0045 | 68 | |
| 0.005 | 89 |
| Value | Count | Frequency (%) |
| 0.0005 | 17 | < 0.1% |
| 0.001 | 3 | < 0.1% |
| 0.0015 | 3 | < 0.1% |
| 0.002 | 7 | < 0.1% |
| 0.0025 | 53 | 0.1% |
| 0.003 | 37 | 0.1% |
| 0.0035 | 83 | |
| 0.004 | 5 | < 0.1% |
| 0.0045 | 106 | |
| 0.005 | 169 |
Shell weight
Real number (ℝ)
| Train | Test | |
|---|---|---|
| Distinct | 1129 | 1089 |
| Distinct (%) | 1.2% | 1.8% |
| Missing | 0 | 0 |
| Missing (%) | 0.0% | 0.0% |
| Infinite | 0 | 0 |
| Infinite (%) | 0.0% | 0.0% |
| Mean | 0.22589784 | 0.22612471 |
| Train | Test | |
|---|---|---|
| Minimum | 0.0015 | 0.0015 |
| Maximum | 1.005 | 1.004 |
| Zeros | 0 | 0 |
| Zeros (%) | 0.0% | 0.0% |
| Negative | 0 | 0 |
| Negative (%) | 0.0% | 0.0% |
| Memory size | 708.1 KiB | 472.1 KiB |
Quantile statistics
| Train | Test | |
|---|---|---|
| Minimum | 0.0015 | 0.0015 |
| 5-th percentile | 0.031 | 0.0325 |
| Q1 | 0.12 | 0.12 |
| median | 0.225 | 0.225 |
| Q3 | 0.305 | 0.305 |
| 95-th percentile | 0.46 | 0.46 |
| Maximum | 1.005 | 1.004 |
| Range | 1.0035 | 1.0025 |
| Interquartile range (IQR) | 0.185 | 0.185 |
Descriptive statistics
| Train | Test | |
|---|---|---|
| Standard deviation | 0.13020334 | 0.12982647 |
| Coefficient of variation (CV) | 0.5763815 | 0.5741366 |
| Kurtosis | 0.096048966 | 0.042671196 |
| Mean | 0.22589784 | 0.22612471 |
| Median Absolute Deviation (MAD) | 0.0905 | 0.0915 |
| Skewness | 0.47909249 | 0.46852363 |
| Sum | 20469.733 | 13660.42 |
| Variance | 0.016952909 | 0.016854912 |
| Monotonicity | Not monotonic | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) |
| 0.24 | 1628 | 1.8% |
| 0.22 | 1269 | 1.4% |
| 0.25 | 1259 | 1.4% |
| 0.275 | 1211 | 1.3% |
| 0.265 | 1200 | 1.3% |
| 0.295 | 1131 | 1.2% |
| 0.27 | 1130 | 1.2% |
| 0.17 | 1094 | 1.2% |
| 0.26 | 1049 | 1.2% |
| 0.285 | 1008 | 1.1% |
| Other values (1119) | 78636 |
| Value | Count | Frequency (%) |
| 0.24 | 1009 | 1.7% |
| 0.22 | 880 | 1.5% |
| 0.265 | 837 | 1.4% |
| 0.25 | 829 | 1.4% |
| 0.17 | 784 | 1.3% |
| 0.26 | 762 | 1.3% |
| 0.275 | 756 | 1.3% |
| 0.27 | 726 | 1.2% |
| 0.295 | 719 | 1.2% |
| 0.28 | 677 | 1.1% |
| Other values (1079) | 52432 |
| Value | Count | Frequency (%) |
| 0.0015 | 4 | < 0.1% |
| 0.0018 | 1 | < 0.1% |
| 0.002 | 1 | < 0.1% |
| 0.0025 | 8 | < 0.1% |
| 0.003 | 14 | < 0.1% |
| 0.0035 | 22 | < 0.1% |
| 0.004 | 20 | < 0.1% |
| 0.0045 | 4 | < 0.1% |
| 0.005 | 299 | |
| 0.0055 | 11 | < 0.1% |
| Value | Count | Frequency (%) |
| 0.0015 | 6 | < 0.1% |
| 0.0025 | 4 | < 0.1% |
| 0.003 | 5 | < 0.1% |
| 0.0035 | 7 | < 0.1% |
| 0.004 | 20 | < 0.1% |
| 0.0045 | 3 | < 0.1% |
| 0.005 | 188 | |
| 0.0055 | 8 | < 0.1% |
| 0.006 | 16 | < 0.1% |
| 0.0065 | 13 | < 0.1% |
| Value | Count | Frequency (%) |
| 0.0015 | 6 | < 0.1% |
| 0.0025 | 4 | < 0.1% |
| 0.003 | 5 | < 0.1% |
| 0.0035 | 7 | < 0.1% |
| 0.004 | 20 | < 0.1% |
| 0.0045 | 3 | < 0.1% |
| 0.005 | 188 | |
| 0.0055 | 8 | < 0.1% |
| 0.006 | 16 | < 0.1% |
| 0.0065 | 13 | < 0.1% |
| Value | Count | Frequency (%) |
| 0.0015 | 4 | < 0.1% |
| 0.0018 | 1 | < 0.1% |
| 0.002 | 1 | < 0.1% |
| 0.0025 | 8 | < 0.1% |
| 0.003 | 14 | < 0.1% |
| 0.0035 | 22 | < 0.1% |
| 0.004 | 20 | < 0.1% |
| 0.0045 | 4 | < 0.1% |
| 0.005 | 299 | |
| 0.0055 | 11 | < 0.1% |
Rings
Real number (ℝ)
| Distinct | 28 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 9.6967941 |
| Minimum | 1 |
|---|---|
| Maximum | 29 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 708.1 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 5 |
| Q1 | 8 |
| median | 9 |
| Q3 | 11 |
| 95-th percentile | 16 |
| Maximum | 29 |
| Range | 28 |
| Interquartile range (IQR) | 3 |
Descriptive statistics
| Standard deviation | 3.1762209 |
|---|---|
| Coefficient of variation (CV) | 0.32755372 |
| Kurtosis | 2.6129342 |
| Mean | 9.6967941 |
| Median Absolute Deviation (MAD) | 2 |
| Skewness | 1.204273 |
| Sum | 878675 |
| Variance | 10.088379 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=28)
| Value | Count | Frequency (%) |
| 9 | 17465 | |
| 8 | 14499 | |
| 10 | 12464 | |
| 7 | 9008 | |
| 11 | 8407 | |
| 6 | 5411 | 6.0% |
| 12 | 4719 | 5.2% |
| 13 | 4074 | 4.5% |
| 5 | 2862 | 3.2% |
| 14 | 2507 | 2.8% |
| Other values (18) | 9199 |
| Value | Count | Frequency (%) |
| 1 | 25 | < 0.1% |
| 2 | 29 | < 0.1% |
| 3 | 386 | 0.4% |
| 4 | 1402 | 1.5% |
| 5 | 2862 | 3.2% |
| 6 | 5411 | 6.0% |
| 7 | 9008 | |
| 8 | 14499 | |
| 9 | 17465 | |
| 10 | 12464 |
| Value | Count | Frequency (%) |
| 29 | 24 | < 0.1% |
| 27 | 41 | < 0.1% |
| 26 | 18 | < 0.1% |
| 25 | 22 | < 0.1% |
| 24 | 29 | < 0.1% |
| 23 | 180 | 0.2% |
| 22 | 108 | 0.1% |
| 21 | 255 | 0.3% |
| 20 | 507 | |
| 19 | 639 |
Interactions
Train
Test
Interaction plot not present for dataset
Train
Test
Train
Test
Train
Test
Train
Test
Train
Test
Train
Test
Train
Test
Train
Test
Train
Test
Interaction plot not present for dataset
Train
Test
Train
Test
Train
Test
Train
Test
Train
Test
Train
Test
Train
Test
Train
Test
Train
Test
Interaction plot not present for dataset
Train
Test
Train
Test
Train
Test
Train
Test
Train
Test
Train
Test
Train
Test
Train
Test
Train
Test
Interaction plot not present for dataset
Train
Test
Train
Test
Train
Test
Train
Test
Train
Test
Train
Test
Train
Test
Train
Test
Train
Test
Interaction plot not present for dataset
Train
Test
Train
Test
Train
Test
Train
Test
Train
Test
Train
Test
Train
Test
Train
Test
Train
Test
Interaction plot not present for dataset
Train
Test
Train
Test
Train
Test
Train
Test
Train
Test
Train
Test
Train
Test
Train
Test
Train
Test
Interaction plot not present for dataset
Train
Test
Train
Test
Train
Test
Train
Test
Train
Test
Train
Test
Train
Test
Train
Test
Train
Test
Interaction plot not present for dataset
Train
Test
Train
Test
Train
Test
Train
Test
Train
Test
Train
Test
Train
Test
Train
Test
Train
Test
Interaction plot not present for dataset
Train
Test
Interaction plot not present for dataset
Train
Test
Interaction plot not present for dataset
Train
Test
Interaction plot not present for dataset
Train
Test
Interaction plot not present for dataset
Train
Test
Interaction plot not present for dataset
Train
Test
Interaction plot not present for dataset
Train
Test
Interaction plot not present for dataset
Train
Test
Interaction plot not present for dataset
Correlations
Train
Test
Train
| Diameter | Height | Length | Rings | Sex | Shell weight | Whole weight | Whole weight.1 | Whole weight.2 | id | |
|---|---|---|---|---|---|---|---|---|---|---|
| Diameter | 1.000 | 0.921 | 0.985 | 0.720 | 0.486 | 0.962 | 0.978 | 0.961 | 0.962 | 0.004 |
| Height | 0.921 | 1.000 | 0.916 | 0.757 | 0.434 | 0.941 | 0.936 | 0.901 | 0.924 | 0.005 |
| Length | 0.985 | 0.916 | 1.000 | 0.708 | 0.480 | 0.956 | 0.976 | 0.964 | 0.961 | 0.005 |
| Rings | 0.720 | 0.757 | 0.708 | 1.000 | 0.410 | 0.787 | 0.736 | 0.662 | 0.724 | 0.003 |
| Sex | 0.486 | 0.434 | 0.480 | 0.410 | 1.000 | 0.496 | 0.496 | 0.470 | 0.496 | 0.008 |
| Shell weight | 0.962 | 0.941 | 0.956 | 0.787 | 0.496 | 1.000 | 0.974 | 0.934 | 0.955 | 0.005 |
| Whole weight | 0.978 | 0.936 | 0.976 | 0.736 | 0.496 | 0.974 | 1.000 | 0.977 | 0.980 | 0.005 |
| Whole weight.1 | 0.961 | 0.901 | 0.964 | 0.662 | 0.470 | 0.934 | 0.977 | 1.000 | 0.959 | 0.003 |
| Whole weight.2 | 0.962 | 0.924 | 0.961 | 0.724 | 0.496 | 0.955 | 0.980 | 0.959 | 1.000 | 0.004 |
| id | 0.004 | 0.005 | 0.005 | 0.003 | 0.008 | 0.005 | 0.005 | 0.003 | 0.004 | 1.000 |
Test
| Diameter | Height | Length | Sex | Shell weight | Whole weight | Whole weight.1 | Whole weight.2 | id | |
|---|---|---|---|---|---|---|---|---|---|
| Diameter | 1.000 | 0.920 | 0.985 | 0.484 | 0.961 | 0.977 | 0.961 | 0.961 | 0.010 |
| Height | 0.920 | 1.000 | 0.915 | 0.403 | 0.941 | 0.936 | 0.901 | 0.924 | 0.007 |
| Length | 0.985 | 0.915 | 1.000 | 0.477 | 0.955 | 0.976 | 0.964 | 0.961 | 0.009 |
| Sex | 0.484 | 0.403 | 0.477 | 1.000 | 0.493 | 0.493 | 0.468 | 0.495 | 0.000 |
| Shell weight | 0.961 | 0.941 | 0.955 | 0.493 | 1.000 | 0.974 | 0.934 | 0.955 | 0.008 |
| Whole weight | 0.977 | 0.936 | 0.976 | 0.493 | 0.974 | 1.000 | 0.977 | 0.980 | 0.009 |
| Whole weight.1 | 0.961 | 0.901 | 0.964 | 0.468 | 0.934 | 0.977 | 1.000 | 0.960 | 0.009 |
| Whole weight.2 | 0.961 | 0.924 | 0.961 | 0.495 | 0.955 | 0.980 | 0.960 | 1.000 | 0.008 |
| id | 0.010 | 0.007 | 0.009 | 0.000 | 0.008 | 0.009 | 0.009 | 0.008 | 1.000 |
Missing values
Train
A simple visualization of nullity by column.
Test
A simple visualization of nullity by column.
Train
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
Test
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
Sample
Train
| id | Sex | Length | Diameter | Height | Whole weight | Whole weight.1 | Whole weight.2 | Shell weight | Rings | |
|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 0 | F | 0.550 | 0.430 | 0.150 | 0.7715 | 0.3285 | 0.1465 | 0.2400 | 11 |
| 1 | 1 | F | 0.630 | 0.490 | 0.145 | 1.1300 | 0.4580 | 0.2765 | 0.3200 | 11 |
| 2 | 2 | I | 0.160 | 0.110 | 0.025 | 0.0210 | 0.0055 | 0.0030 | 0.0050 | 6 |
| 3 | 3 | M | 0.595 | 0.475 | 0.150 | 0.9145 | 0.3755 | 0.2055 | 0.2500 | 10 |
| 4 | 4 | I | 0.555 | 0.425 | 0.130 | 0.7820 | 0.3695 | 0.1600 | 0.1975 | 9 |
| 5 | 5 | F | 0.610 | 0.480 | 0.170 | 1.2010 | 0.5335 | 0.3135 | 0.3085 | 10 |
| 6 | 6 | M | 0.415 | 0.325 | 0.110 | 0.3315 | 0.1655 | 0.0715 | 0.1300 | 9 |
| 7 | 7 | F | 0.610 | 0.490 | 0.150 | 1.1165 | 0.4955 | 0.2945 | 0.2950 | 9 |
| 8 | 8 | I | 0.205 | 0.150 | 0.040 | 0.0460 | 0.0145 | 0.0105 | 0.0100 | 4 |
| 9 | 9 | I | 0.565 | 0.425 | 0.125 | 0.6510 | 0.3795 | 0.1420 | 0.1800 | 8 |
Test
| id | Sex | Length | Diameter | Height | Whole weight | Whole weight.1 | Whole weight.2 | Shell weight | |
|---|---|---|---|---|---|---|---|---|---|
| 0 | 90615 | M | 0.645 | 0.475 | 0.155 | 1.2380 | 0.6185 | 0.3125 | 0.3005 |
| 1 | 90616 | M | 0.580 | 0.460 | 0.160 | 0.9830 | 0.4785 | 0.2195 | 0.2750 |
| 2 | 90617 | M | 0.560 | 0.420 | 0.140 | 0.8395 | 0.3525 | 0.1845 | 0.2405 |
| 3 | 90618 | M | 0.570 | 0.490 | 0.145 | 0.8740 | 0.3525 | 0.1865 | 0.2350 |
| 4 | 90619 | I | 0.415 | 0.325 | 0.110 | 0.3580 | 0.1575 | 0.0670 | 0.1050 |
| 5 | 90620 | M | 0.560 | 0.425 | 0.140 | 0.8105 | 0.3525 | 0.1915 | 0.2150 |
| 6 | 90621 | M | 0.635 | 0.490 | 0.170 | 1.1835 | 0.4605 | 0.2445 | 0.3550 |
| 7 | 90622 | I | 0.340 | 0.250 | 0.075 | 0.1675 | 0.0750 | 0.0330 | 0.0480 |
| 8 | 90623 | I | 0.485 | 0.370 | 0.110 | 0.5360 | 0.2565 | 0.0980 | 0.1490 |
| 9 | 90624 | F | 0.640 | 0.500 | 0.195 | 1.3380 | 0.6470 | 0.3175 | 0.3965 |
Train
| id | Sex | Length | Diameter | Height | Whole weight | Whole weight.1 | Whole weight.2 | Shell weight | Rings | |
|---|---|---|---|---|---|---|---|---|---|---|
| 90605 | 90605 | M | 0.560 | 0.450 | 0.155 | 0.9055 | 0.3925 | 0.1775 | 0.2800 | 9 |
| 90606 | 90606 | M | 0.575 | 0.450 | 0.165 | 1.0985 | 0.3765 | 0.2150 | 0.4000 | 14 |
| 90607 | 90607 | F | 0.555 | 0.425 | 0.155 | 0.8790 | 0.3410 | 0.2065 | 0.2500 | 10 |
| 90608 | 90608 | I | 0.350 | 0.265 | 0.075 | 0.1735 | 0.0760 | 0.0590 | 0.0525 | 6 |
| 90609 | 90609 | F | 0.650 | 0.525 | 0.185 | 1.7070 | 0.6605 | 0.3545 | 0.4735 | 14 |
| 90610 | 90610 | M | 0.335 | 0.235 | 0.075 | 0.1585 | 0.0685 | 0.0370 | 0.0450 | 6 |
| 90611 | 90611 | M | 0.555 | 0.425 | 0.150 | 0.8790 | 0.3865 | 0.1815 | 0.2400 | 9 |
| 90612 | 90612 | I | 0.435 | 0.330 | 0.095 | 0.3215 | 0.1510 | 0.0785 | 0.0815 | 6 |
| 90613 | 90613 | I | 0.345 | 0.270 | 0.075 | 0.2000 | 0.0980 | 0.0490 | 0.0700 | 6 |
| 90614 | 90614 | I | 0.425 | 0.325 | 0.100 | 0.3455 | 0.1525 | 0.0785 | 0.1050 | 8 |
Test
| id | Sex | Length | Diameter | Height | Whole weight | Whole weight.1 | Whole weight.2 | Shell weight | |
|---|---|---|---|---|---|---|---|---|---|
| 60401 | 151016 | F | 0.585 | 0.455 | 0.155 | 0.9125 | 0.3125 | 0.1935 | 0.3200 |
| 60402 | 151017 | I | 0.400 | 0.315 | 0.095 | 0.2645 | 0.1150 | 0.0530 | 0.0740 |
| 60403 | 151018 | F | 0.605 | 0.475 | 0.145 | 0.9740 | 0.4305 | 0.2300 | 0.3150 |
| 60404 | 151019 | I | 0.560 | 0.430 | 0.130 | 0.7650 | 0.3065 | 0.1740 | 0.2565 |
| 60405 | 151020 | M | 0.570 | 0.435 | 0.125 | 0.9265 | 0.3685 | 0.2015 | 0.2950 |
| 60406 | 151021 | I | 0.345 | 0.260 | 0.085 | 0.1775 | 0.0735 | 0.0265 | 0.0500 |
| 60407 | 151022 | F | 0.525 | 0.410 | 0.145 | 0.8445 | 0.3885 | 0.1670 | 0.2050 |
| 60408 | 151023 | I | 0.590 | 0.440 | 0.155 | 1.1220 | 0.3930 | 0.2000 | 0.2650 |
| 60409 | 151024 | F | 0.660 | 0.525 | 0.190 | 1.4935 | 0.5885 | 0.3575 | 0.4350 |
| 60410 | 151025 | F | 0.430 | 0.340 | 0.120 | 0.4150 | 0.1525 | 0.0910 | 0.0905 |
Duplicate rows
Train
| id | Sex | Length | Diameter | Height | Whole weight | Whole weight.1 | Whole weight.2 | Shell weight | Rings | # duplicates | |
|---|---|---|---|---|---|---|---|---|---|---|---|
| Dataset does not contain duplicate rows. | |||||||||||
Test
| id | Sex | Length | Diameter | Height | Whole weight | Whole weight.1 | Whole weight.2 | Shell weight | # duplicates | |
|---|---|---|---|---|---|---|---|---|---|---|
| Dataset does not contain duplicate rows. | ||||||||||